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I Abstract. In this paper, we obtain complete characterization for nested canalyzing functions 

04 . (NCFs) by obtaining its unique algebraic normal form (polynomial form). We introduce a new 

^ ' concept, LAYER NUMBER for NCF. Based on this, we obtain explicit formulas for the the 

^ , following important parameters; 1) Number of all the nested canalyzing functions, 2) Number 

of aU the NCFs with given LAYER NUMBER, 3) Hamming weight of any NCF, 4) The activity 
number of any variable of any NCF, 5) The average sensitivity of any NCF. Based on these 
, formulas, we show the activity number is greater for those variables in out layer and equal in the 

same layer. We show the average sensitivity attains minimal value when the NCF has only one 
layer. We also prove the average sensitivity for any NCF (No matter how many variables it has) 
is between and 2. Hence, theoretically, we show why NCF is stable since a random Boolean 
function has average sensitivity ^. Finally we conjecture that the NCF attain the maximal 
average sensitivity if it has the maximal LAYER NUMBER n — 1. Hence, we guess the uniform 
^ upper bound for the average sensitivity of any NCF can be reduced to | which is tight. 

^ ! 1. Introduction 

!>■ 

Canalyzing function were introduced by Kauffman [19] as appropriate rules in Boolean net- 
work models or gene regulatory networks. Canalyzing functions are known to have other impor- 
tant applications in physics, engineering and biology. In [30] it was shown that the dynamics of 
a Boolean network which operates according to canalyzing rules is robust with regard to small 
perturbations. In [TH], W. Just, I. Shmulevich and J. Konvalina derived an exact formula for the 
number of canalyzing functions. In [28], the definition of canalyzing functions was generalized to 
any finite fields Fg, where g is a power of a prime. Both the exact formulas and the asymptotes 
of the number of the generalized canalyzing functions were obtained. 

Nested Canalyzing Functions (NCFs) were introduced recently in [20]. One important charac- 
teristic of (nested) canalyzing functions is that they exhibit a stabilizing effect on the dynamics 
of a system. That is, small perturbations of an initial state should not grow in time and must 
eventually end up in the same attractor of the initial state. The stability is typically measured 
using so-called Derrida plots which monitor the Hamming distance between a random initial 
state and its perturbed state as both evolve over time. If the Hamming distance decreases over 
time, the system is considered stable. The slope of the Derrida curve is used as a numerical 
measure of stability. Roughly speaking, the phase space of a stable system has few components 
and the limit cycle of each component is short. 

In |21] , the authors studied the dynamics of nested canalyzing Boolean networks over a variety 
of dependency graphs. That is, for a given random graph on n nodes, where the in-degree of 
each node is chosen at random between and k, where k < n, a nested canalyzing function is 
assigned to each node in terms of the in-degree variables of that node. The dynamics of these 
networks were then analyzed and the stability measured using Derrida plots. It is shown that 
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nested canalyzing networks are remarkably stable regardless of the in-degree distribution and 
that the stability increases as the average number of inputs of each node increases. 

An extensive analysis of available biological data on gene regulations (about 150 genes) showed 
that 139 of them are regulated by canalyzing functions [13]. In [211 133| . it was shown that 133 
of the 139 are in fact nested canalyzing. 

Most published molecular networks are given in the form of a wiring diagram, or dependency 
graph, constructed from experiments and prior published knowledge. However, for most of 
the molecular species in the network, little knowledge, if any, could be deduced about their 
regulatory mechanisms, for instance in the gene transcription networks in yeast [TJj and E. Coli 
[3]. Each one of these networks contains more than 1000 genes. Kauffman et. al [20] investigated 
the effect of the topology of a sub-network of the yeast transcriptional network where many of 
the transcriptional rules are not known. They generated ensembles of different models where 
all models have the same dependency graph. Their heuristic results imply that the dynamics of 
those models which used only nested canalyzing functions were far more stable than the randomly 
generated models. Since it is already established that the yeast transcriptional network is stable, 
this suggests that the unknown interaction rules are very likely nested canalyzing functions. In 
a recent article [2], the whole transcriptional network of yeast, which has 3459 genes as well 
as the transcriptional networks of E. Coli (1481 genes) and B. subtillis (840 genes) have been 
analyzed in a similar fashion, with similar findings. 

These heuristic and statistical results show that the class of nested canalyzing functions is 
very important in systems biology. It is shown in [15] that this class is identical to the class 
of so-called unate cascade Boolean functions, which has been studied extensively in engineering 
and computer science. It was shown in [8] that this class produces the binary decision diagrams 
with the shortest average path length. Thus, a more detailed mathematical study of this class 
of functions has applications to problems in engineering as well. 

In [16], the authors provided a description of nested canalyzing function. As a corollary of the 
equivalence, a formula in the literature for the number of unate cascade functions also provides 
such a formula the number of nested canalyzing functions. Recently, in [32], those results were 
generalized to the multi-state nested canalyzing functions on finite fields ¥p, where p is a prime. 
They obtained the formula for the number of the generalized NCFs, as a recursive relation. 

In [12], Cook et al. introduced the notion of sensitivity as a combinatorial measure for Boolean 
functions providing lower bounds on the time needed by CREW PRAM (concurrent read , but 
exclusive write (CREW) parallel random access machine (PRAM)). It was extended by Nisan 
|34j to block sensitivity. It is still open whether sensitivity and block sensitivity are polynomially 
related (they are equal for monotone Boolean functions). Although the definition is straight- 
forward, the sensitivity is understood only for a few classes function. For monotone functions, 
Ilya Shmulevich [37] derived asymptotic formulas for a typical monotone Boolean functions. Re- 
cently, Shengyu Zhang [l3] find a formula for the average sensitivity of any monotone Boolean 
functions, hence, a tight bound is derived. In [38], Ilya Shmulevich and Stauart A. Kauffman 
considered the activities of the variables of Boolean functions with only one canalyzing variable. 
They obtained the average sensitivity of this kind of Boolean function. 

In this paper, we revisit the NCF, obtaining a more explicit characterization of the Boolean 
NCFs than those in [16]. We introduce a new concept, the LAYER NUMBER in order 
to classify all the variables. Hence, the dominance of the variable can be quantified. As a 
consequence, we obtain an explicit formula for the number of NCFs. Thus, a nonlinear recursive 
relation (the original formula) is solved, which maybe of independent mathematical interest. 
Using our unique algebraic normal form of NCF, for any NCF, we get the formula of activity 
for its variables. We show that the variables in a more dominant layer have greater activity 
number. Variables in the same layer have the same activity numbers. Consequently, we obtain 
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the formula of any NCF's average sensitivity, its lower bound is i^^rrr and its upper bound is 2 
(No matter what n is) which is much less than ^, the average sensitivity of a random Boolean 
function. So, theoretically, we proved why NCF is "stable". We also find the formula of the 
Hamming weight of each NCF. Finally, we conjecture that the NCF attains its maximal value 
if it has the maximal LAYER NUMBER n — 1. Hence, we guess the tight upper bound is |. In 
the next section, we introduce some definitions and notations. 

2. Preliminaries 

In this section we introduce the definitions and notations. Let F = F2 be the Galois field 
with 2 elements. If / is a n variable function from F"" to F, it is well known |29J that / can be 
expressed as a polynomial, called the algebraic normal form(ANF): 

/(xi,X2, . . . ,X„) = ^ ak^k2...k^Xi'^X2''^ ■ ■ -Xn" 

Q<ki<l,i=l,...,n 

where each coefficient akik2...k„ € F is a constant. The number ki+k2 + - ■ ■+kn is the multivariate 
degree of the term afcifc2...fc„^i'^^^2'^^ ■ ■ ■ 2;n'^" with nonzero coefficient afcifc2...fc„- The greatest 
degree of all the terms of / is called the algebraic degree, denoted by deg{f). 

Definition 2.1. f{xi,X2, • • • , is essential in variable Xi if there exist r, s G F and x^, . . . , x*_^ 
) ^i+i) ■ ■ ■ ,Xn such that /(x^ . . . , x*_^, r, x*_^_-^^, . . . , x^) ^ f{xi, . . . , x*_^, s, x*_^_-^^, . . . , x^). 

Definition 2.2. A function f{xi,X2, ■ ■ ■ ,x„) is < i : a : b > canalyzing if 

f{xi, . . . ,Xi^i,a,Xi+i, ... ,Xn) =b, for all xj, j / i, where i € {1, . . . ,n}, a,b € F. 

The definition is reminiscent of the concept of canalisation introduced by the geneticist C. H. 
Waddington [H] to represent the ability of a genotype to produce the same phenotype regardless 
of environmental variability. 

Definition 2.3. Let f be a Boolean function in n variables. Let a be a permutation on 
{1,2, . . . ,n}. The function f is nested canalyzing function (NCF) in the variable order 

Xo-(i), . . . , Xo-(„) with canalyzing input values ai, . . . , a„ and canalyzed values 61, . . . , 6„, if it 
can be represented in the form 



f {x\, . . . , Xn) — 



r 61 




= ai 








b2 




= oT 


Xa(2) 


= a2, 




< ^3 


Xa{l) 


= oT 


Xa{2) 


= 02, 


Xa{3) = «3> 


bn 


Xa{l) 


= ol 


Xa{2) 


= 02, 


• • • ) ^tT(n— 1) ~ 0,n—l: X„(^n) — O^rii 


, bn 


Xa{l) 


= oT 


Xa{2) 


= 02, 


• • • ) X(j{^fi—1) ~ O^n—li Xfj{n) ~ ^n- 



Where a = a © 1 . The function f is nested canalyzing if f is nested canalyzing in the variable 
order Xo-{i), . . . , Xo-(n) for some permutation a. 

Let a = (oi, a2, . . . , a^) and /3 = (61, 62, • • • , bn), we say f is {a : a : (3} NCF if it is NCF in 
the variable order Xo-(i), . . . , x^f^n) with canalyzing input values a = (ai, . . . , On) and canalyzed 
values /3 = (61, ... , 6„). 

Given vector a = (ai, 02, . . . , an), we define a*^'"''** = (ai, . . . ,0^7, ■ ■ ■ , ol^, • • • , On) 

From the above definition, we immediately have the following 

Proposition 2.4. f is {a : a : (3} NCF / is {a : a" : /?"} NCF 

Example 2.5. /(xi,X2,X3) = xi(x2 © l)x3 © 1 is {(1,2,3) : (0,1,0) : (1,1,1)} NCF. 
Actually, one can check this function is nested canalyzing in any variable order. 
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Example 2.6. f{xi,X2,X3) = (xi © 1)(3;2(2;3 © 1) © 1) © 1. This function is 
{(1, 2, 3) : (1,0, 1) : (1,0, 0)} NCF. It is also {(1, 3, 2) : (1, 1, 1) : (1, 0, 1)} NCF. 

One can check this function can be nested canalyzing in only two variable orders {xi, X2, x^) 
and (xi, X3, X2). 

From the above definitions, we know a function is NCF, all the n variable must be essential. 
However, a constant function b can be < i : a : 6 > canalyzing for any i and a. 

3. A Complete Characterization for NCF 

In [25], the author introduced Partially Nested Canalyzing Functions (PNCFs), a general- 
ization of the NCFs, and the nested canalyzing depth, which measures the extent to which it 
retains a nested canalyzing structure. In |17j . the author introduced the extended monomial 
system. 

As we will see, in a Nested Canalyzing Function, some variables are more dominant than the 
others. We will classify all the variables of a NCF into different levels according to the extent 
of their dominance. Hence, we will give description about NCF with more detail. Actually, we 
will obtain clearer description about NCF by introducing a new concept: LAYER NUMBER. 
As a by-product, we also obtain some enumeration results. Eventually, we will find an explicit 
formula of the number of all the NCFs. 

First, we have 

Definition 3.1. jl7j M{xi, . . . , x„) is an extended monomial of essential variables xi, . . . ,Xn if 

M{xi, . . . ,Xn) = {xi © ai){x2 © a2)...{xn © an), where ai € F2. 

Basically, we will rewrite Theorem 3.1 in [16] with more information. 
Lemma 3.2. f{xi,X2, ■■■Xn) is < i : a : b > canalyzing iff 

f{X) = f{xi,X2, Xn) = (Xi © a)Q{xi, Xi_i, Xj+i ...Xn)®b. 

Proof. From the algebraic normal form of /, we rewrite it as / = Xigi{Xi) © gQ{Xi), where 
Xi = (xi, . . . ,Xi_i,Xi+i, . . . ,x„). Hence, f{X) = /(xi,X2, . . . ,Xn) = {xi®a)gi{Xi)@agi{Xi) © 
goiXi). Let gi{Xi) = Q{xi, . . . ,Xi_i,Xi+i . . . x^) and r(Xj) = agi{Xi) ® go{Xi). Then /(X) = 

/(Xi, . . . ,Xn) = (xi © a)Q{xi, . . . ,Xi_i,Xi+i . . .Xn) © r{Xi) 

Since f{X) is < i : a : b > canalyzing, we get f{X) = /(xi, ...Xj_i, a, Xj+i, . . . ,Xn) = b for 
any xi, . . . , Xj_i, Xj+i . . . x^, i.e., r{Xi) = b for any Xi. So r{Xi) must be the constant b. We 
finished the necessity. The sufficiency is obvious. □ 

Remark 3.3. 1) When we contrast this lemma to the first part of Theorem 3.1 in ^^,we make 
clear that here, the Xj is not essential in Q. 2) In p8], there is a general version of this Lemma 
over any finite fields. 3) In the above lemma, if f is constant, then (5 = 0. 

From Definition 12.31 we have the following 

Proposition 3.4. // /(xi, . . . , x^) is {a : a : /?} NCF, i.e., if it is NCF in the variable order 

Xo-(i), . . . , Xo-(„) with canalyzing input values a = (oi,...,an) and canalyzed values (3 = 
(61,..., 6„). 

Then, for 1 < k < n — 1, let Xo-(i) = oT, . . . , a;(T(fc) = then the function 

a(l) , r 

/(xi, . . . , ai , . . . , , . . . , Xn) is {cr* : a* : /?*} NCF on those remaining variables, where 
cr* = x^(fc+i), . . . ,x^(„), a* = (afc+i, • • • ,a„) and {3* = (6fc+i, • • -^K)- 

Definition 3.5. ///(xi, . . . ,x„) is a NCF. We call variable Xj the most dominant variable of 
f , if there is an order a = (x,, . . .) such that f is NCF with this variable order (In other words, 
if f is also < i : a : b > canalyzing for some a and b). 
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In Example 12. 5| all the three variables are most dominant, in Example 12.61 only xi is the 
most dominant variable. We have 

Theorem 3.1. Given NCF f{xi, . . . all the variables are most dominant iff 

f = M(xi, . . . , Xn) © b, where M is an extended monomial, i.e., 
M = (xi ai){x2e a2).:{x 

n ffi O-n ) ■ 

Proof, xi is the most dominant, from Lemma 13.21 we know there exist ai and b such that 

f{xi,X2, . . . ,Xn) = {xi (Bai)Q{x2, . . . ,Xn) (Bb, i.e., {xi ©ai)|(/©6). Now, X2 is also the most 
dominant, we have 02 and b' such that 

f{xi, a2, ^3, . . . , Xn) = b' for any rci, 2:3, . . . , Xn- Specifically, let x\ = ai, we get 
f{ai,a2,xs, . . . ,Xn) = b = b'. Hence, we also get (x2 © 02)|(/ ®b) = (xi © ai)Q{x2, ■ ■ ■ ,Xn), 
since xi © ai and X2 © 02 are coprime, we get {x2 © a2)\Q{x2, ■ ■ ■ , hence, f{xi,X2, ■ ■ ■ , x^) = 
{xi © ai)(x2 © a2)Q'{x^, . . . ,Xn) © b. With induction principle, the necessity is proved. The 
sufficiency if evident. □ 

We are ready to prove the following main result of this section. 

Theorem 3.2. Given n >2, f{xi,X2, ■ ■ ■ ,Xn) is nested canalyzing iff it can be uniquely written 
as 

f{xi,X2, ...,Xn) = Mi(M2(. . . {Mr-l{Mr © 1) © 1) . . .) © 1) © 6. (3.1) 

Where each Mi is an extended monomial of a set of disjoint variables. More precisely. Mi = 
Ylf=i{xi. ® ai.), i = 1,. . . ,r, ki > 1 for i = 1,. . . ,r - 1, kr > 2, ki ® . . . ® kr = n, Qij £ ¥2, 
{ij\j = l,...,ki,i = l,...,r} = {l,...,n}. 

Proof. We use induction on n. 

When n = 2, there are 16 boolean functions, 8 of them are NCFs, Namely 

{xi © ai){x2 © a2) © c = Ml © 1 © 6, where 6 = 1 © c and Mi = {xi © ai){x2 © 02)- 

If (xi ©ai)(x2 ©02) ©c = (xi ©ai')(x2 ©a2)'©c', by equating the coefficients, we immediately 

obtain ai = oi', 02 = 02' and c = c'. So, uniqueness is true. 

We have proved that equation 13.11 is true for n = 2, where r = 1. 

Let's assume that equation 13.11 is true for any nested canalyzing function which has at most 
n — 1 essential variables. 

Now, consider NCF /(xi, . . . , x„). 

Suppose Xo-(i), . . . are all the most dominant canalyzing variables of /, 1 < A;i < n. 

Case 1: /ci = n, by Theorem 13. H the conclusion is true with r = 1. 

Case 2: ki < n, with the same arguments to Theorem 13.1} we can get / = Mig © 6, where 
Ml = (X(^(i) © a<^(i)) . . . (x^(fc) © aCT(fc)). Let x^(i) = a^^, x^(fc) = in /, the function 

g (B b, hence, g, of the remaining variables will also be nested canalyzing by Proposition 13.41 

Since g has n — ki < n — 1 variables, by induction assumption, we get 

g = M2{M^{. . . {Mr-i{Mr © 1) © 1) . . .) © 1) © 61, at this time, 61 must be 1. Otherwise, all 

the variables in M2 will also be the most dominant variables of /. Hence, we are done. □ 

Because each NCF can be uniquely written as 13.11 and the number r is uniquely determined 
by /, we have 

Definition 3.6. For a NCF written as equation \3.1l the number r will be called its LAYER 
NUMBER. Essential variables of Mi will be called the most dominant variables (canalyzing vari- 
able), they belong to the first layer of this NCF. Essential variables of M2 will be called the 
second most dominant variables and belong to the second layer of this NGF and etc. 

The function in example 12.51 has LAYER NUMBER 1 and the function in example 12.61 has 
LAYER NUMBER 2. 
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Remark 3.7. In Theorem \3.2l 1) kr > 2. It is impossible that kj. = 1. Otherwise, Mr © 1 will 
be a factor of M^-i which means LAYER NUMBER is r — 1. 2) If variable Xi is in the first 
layer, and factor of Mi, then this NCF is < i : Oi : b > canalyzing, we simply say Xi 

is a canalyzing variable of this NCF. 

Let NCF(n, r) stands for the set of all the n variable nested canalyzing functions with LAYER 
NUMBER r and NCF(n) stands for the set of all the n variable nested canalyzing functions. 
We have 

Corollary 3.8. Given n > 2, 

|NCF(n,r)| = 2'^+i E C " J 



k\+...+kr=n 
k,>l,i=l,...,r-l,kr>2 



and 

ra-1 



incf(„)i=2"«j: y. u,.. ,,„.,) 



r=l k-i+...+kr=n 

k,>l,i=l,...,r-l,kr>2 

I 



Where the multinomial coefficient (^^ J = ^^"'^ 

Proof. From Equation 13.11 for each choice ki, . . . ,kr, with condition ki + . . . + kr = n, ki > 1, 
i = 1, . . . , r — 1 and kr > 2, 

there are 2^^! (^^) many ways to form Mi, 

there are 2^^(J^^^^^ many ways to form M2, 
. . . , 

there are (J^~''''-~j^'~''''^^^ many ways to form Mr, 

b has two choices. 

Hence, 



fcl+...+fcr=ra 
ki>l,i=l,...,r-l,kr>2 



n\ fn — ki\ fn — ki — . . . — kr-i 
k2 J ' \ kr 



|NCF(n,r)|=2 ^ 2'"^+-+''^ ("^^ 

,>2 

n! {n — ki)\ {n — ki — . . . — kr--\)\ 



-)ra+l 



E 



(*^i)!(n - fci)! (A;2)l(n - *i - fa)! tK" - fci - ■ ■ ■ - 

ki>l,i=l,...,r-l,kr>2 

fcl+...+fcr=n k\+...+kr=n 
k^>l,i=l,...,r-l,kr>2 ki>l,i=l,...,r-l,kr>2 

Since NCF(n) = Ur=i NCF(n,r) and NCF(n, i) fl NCF(n, j) = when i / j, we get the 
formula of I NCF(n)j. □ 

One can check that |NCF(2)| = 8, |NCF(3)| = 64, |NCF(4)| = 736, |NCF(5)| = 10624,... 
These results are consistent with those in [H |36] . 

By equating our formula to the recursive relation in [H [36] , we have the following 
Corollary 3.9. The solution of the nonlinear recursive sequence 

n-l 



r=2 ^ 



NESTED CANALYZING FUNCTIONS AND THEIR AVERAGE SENSITIVITIES 



7 



IS 

n—1 / 

r=l fci+...+fcr=n ^ • • • ' 
k{>\,i=\,...,r-\,kr>2 

4. Activity, Sensitivity and Hamming Weight 

A Boolean function is balanced if exactly half of its value is zero. Equivalently, the Hamming 
weight of this n variables Boolean function is 2"~^. There are (2^-1) balanced functions . It 
is easy to show that a Boolean functions with canalyzing variables is not balanced, i.e., biased. 
Actually, very biased. For example. Two constant functions are trivially canalyzing. They are 
the most biased. Extended monomial functions are the second most biased since for any of 
them, only one value is nonzero. But biased functions may have no canalyzing variables. For 
example, /(xi,X2,X3) = x\X2Xj,®x\X2®x\Xj,®X2X'i is biased but without canalyzing variables. 

In Boolean functions, some variable have greater influence over the output of the function than 
other variables. To formalize this, a concept called activity was introduced. Let ^'^^^g^'. '^"^ = 
/(xi, . . . , Xj © 1, . . . , Xn) /(xi, . . . , Xj, . . . , x„). The activity of variable Xj of / is defined as 

* 2" ^ dXi ^ ^ 

(xi,...,a:„)GF5 

Note, the above definition can also be written as the following 

a^^ = ^ Yl (/(xi,...,0,...,x„)e/(xi,...,l,...,x„)) (4.2) 

The activity of any variables of constant functions is 0. For affine function /(xi, . . . ,x„) = 
xi © . . . ® Xn © 6, af = 1 for any i. It is clear, for any / and i, we have < a{ < 1. 

Another important quantity is the sensitivity of a Boolean function, which measures how 
sensitive the output of the function is if the input changes (This was introduced in [12]). The 
sensitivity s-^(xi, . . . , x„) of / on vector (xi,...,x„) is defined as the number of Hamming 
neighbors of (xi, . . . , x„) on which the function value is different from /(xi, . . . , x„). That is. 



s 



^(xi,...,Xn) = |{i|/(xi,...,0, ...,x„) //(xi,...,l,...,x„),z = l,...,n}|. 



Obviously, ./(xi, . . . , x„) = Er=i ^%f^ 
The average sensitivity of function / is defined as 



n 



s-l^ = E[s^Xi,...,Xn)] = ^ Yj s^(xi,...,x„) = ^af. 

ixi,...,Xn)e¥l^ i=l 

It is clear that < S'^ < n. 

The average sensitivity is one of the most studied concepts in the analysis of Boolean functions. 
Recently, It receives a lot of attention. SeePI^EllTllinilllllSSllMllMlESlEZllMlEHll^ 
Bernasconi [S] has showed that a random Boolean function has average sensitivity ^. It means 
the average value of the average sensitivities of all the n variables Boolean functions is ^ . In [38] , 
Ilya Shmulevich and Stuart A. Kauffman calculated the activity of all the variables of a Boolean 
functions with exactly one canalyzing variable and unbiased input for the other variable. Add 
all the activities, the average sensitivity of this function was also obtained. 

In the following, using Equation 13. 1|, we will obtain the formula of the Hamming weight of 
any NCF, the activities of all the variables of any NCF and the average sensitivity (which is 
bounded by constant) of any NCF. 
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First, we have 

Lemma 4.1. (xi © oi) . . . (x^ © a^) = | g' ^otherwise^ ~ " ' '"'''"^ i.e., only one value is 
1 and all the other 2^ — 1 values are 0. 

Theorem 4.1. Given n > 2. Let fi = Mi, fr = Mi(Af2(. . . {Mr-i{Mr © 1) © 1) . . .) © 1) , 

r > 2, where Mi is same as that in the in Theorem \3.SX Then the Hamming weight of fr is 

r 

W{fr) = ^(-iy-i2"-S-i (4.3) 
The Hamming weight of fr (Bl is 

r 

W{fr © 1) = ^(-l)-'2"-^-i (4.4) 
i=o 

Where ^1^=1 should be explained as 0. 

Proof. First, let's consider the Hamming weight of /,.. 
When r = 1, we know the result is true by Lemma |4.1[ 
When r > 1, we consider two cases: 
Case A: r is odd, r = 2t + 1. 

All the vectors make / = 1 will be divided into the following disjoint groups. 

Group 1: Mi = 1, M2 = 0; 

Group 2: Mi = 1, M2 = 1, M3 = 1, M4 = 0; 

Group j: Mi = 1, M2 = 1, . . ., Maj-i = 1, Maj = 0; 

Group t : Ml = 1, M2 = 1, . . ., M2t_i = 1, Mat = 0; 

Group t + 1 : Ml = 1, M2 = 1, . . ., M2t = 1, M2i+i = M^ = L 

In Group 1, the number of vectors is (2*^^ _ i^2'^-ki-k2 ^ 2"-'^i - 2'^-^i-^2. 

In Group 2, the number of vector is {2^* - I)2'^~ki~k2~ks~k4 ^ 2"-fci~'=2-fc3 _ 2n-fci-fc2-fc3-fc4_ 

In Group t, the number of vector is {2''^' - i)2"'-ki---k2t = 2"-'=i---'=2*-i - 2"-'^i---'=2t. 
In Group t + 1, the number of vectors is 2"~'^i^- -^'^'' = 1. 
Add all of them, we get the formula Equation 14.31 
Case B: r is even, r = 2t. 

All the vectors make / = 1 will be divided into the following disjoint groups. 

Group 1: Mi = 1, M2 = 0; 

Group 2: Mi = 1, M2 = 1, M3 = 1, M4 = 0; 

Group j: Mi = 1, M2 = 1, . . ., Ma^-i = 1, M2j = 0; 

Group t-1 : Ml = 1, M2 = 1, . . ., M2t-3 = 1, M2j-2 = 0; 

Group t : Ml = 1, M2 = 1, . . ., M2i_i = 1, Mat = M^ = 0. 

In Group 1, the number of vectors is (2*^^ - i)2"-'=i-'=2 = 2'"-'=i - 2"-^i-'=2_ 

In Group 2, the number of vector is (2'^^ _ i)2"-'=i-'^2-fc3-fc4 = 2"-'^i-^2-fc3 _ 2«-fci-fc2-fe3-fc4_ 

In Group t - 1, the number is {2^^*-^ - i)2''^-ki---k2t-2 = 2"-'=i---'=2t-3 _ 2«-fci---fc2t-2_ 
In Group t, the number of vectors is 2"-'=i---'=2t-i _ 2^~ki~...~k2t ^ 2''2t _ i 
Add all of them, we get the formula Equation 14.31 again. 
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Because |{(xi, . . . ,x.„)|/(xi, . . . = 0}| + \{{xi, . . . ...,Xn) = l}\ = 2", we know 

the Hamming weight of © 1 is 

r r 

W{fr © 1) = 2" - W{fr) = 2" - ^(-l)J-i2"-n=ifc» = ^(_i)J2""n=ifcv 

j=l j=0 

Where Yl^=i ^« should be explained as 0. □ 

In the following, we will calculate the activities of the variables of any NCF. 

Let / be a NCF and written as the form in Theorem 13.21 Without loss of generality(to 
avoid the complicated notation), we assume Mi = {xi © ai){x2 © 02) • • • {xki © Ofci) and mi = 
(xi © oi) . . . {xi-i © ai-i){xi+i © aj+i) . . . (xf,^ © Ofci), i-e., Mi = (xj © Oi)mi. 

If r = 1, i.e., ki = n, then 

O^i = XI (/(Xi,...,0, ©/(xi,...,l,...,Xn)) 

{xi,...,Xi-i,Xi+i,...,x„)eF2^^ 

= ^ E ™1 = ^^("^1) = 

(xi,...,Xi_i,Xi+i,...,x„)GF2"-'- 

by Lemma |4.1[ 
If 1 < r < n - 1, 

Let's consider the activity of Xj in the first layer, i.e., 1 < i < ki. We have 

2^ (/(Xi,. . . ,0, . . . ,Xn) © /(Xi, . . . , 1, . . . ,Xn)) 

(xi,...,Xi_i,a;i+i,...,a;„)eF2~-^ 

= ^ X] mi(M2(...(M,_i(M,©l)©l)...)©l). 

{xi,...,a;i_i,Xi+i,...,x„)GF2~-'- 

^ ^W^(mi(M2(. . . {Mr-l{Mr © 1) © 1) . . .) © 1)). 



2 

2^ E ■=i(-iy-'2-"i-(i:Li '^-D, ki > 1 



^E,'=i(-ir'2"-S'-'=% fci>l 



2n- 


1 


1 




2n- 


1 


1 




2n- 


1 


1 




nn- 


1 



A:i > 1 


A- 


/ci = 1. 




ki > 1 


_ 1 


ki = 1. 


2^-1 


ki = 


1 means 



^E-=o(-iy2"-ni'=^ ki = i. 
^E-=i(-ip"'2"-J:^^i'=- 

by Theorem 14.1 [ Note, in the above, ki = 1 means mi = 1, we used the Equation 14.41 with 
layer number r — 1 and the first layer is M2 for n — 1 variables functions. 

Now let's consider the variables in the second layer, i.e., Xj is an essential variable of M2. We 
have M2 = (xj + ai)m2 and 

«i = ^;i3T 2^ (/(Xi,. . . ,0, . . . ,Xn) © /(xi, . . . , 1, . . . ,X„)) 



1 



{x-^,...,Xi-.1,Xi+-^,...,Xn)&2^^ 



Afl(m2(. . . {Mr-l{Mr © 1) © 1) . . .)). 
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1 



>n— 1 



^ Mim2(. ..(M^_i(M^ei)ei)...)- 



{xi,...,Xi-l,Xi + i,...,Xn)&2 

'I — 1 , r— 1 



in— 1 Z_-^^ ' on— 1 Z_-^^ 



-1 "'I 

2n-l - ^ 

i=i i=i 



by Equation 14.31 in Theorem 14.11 Note, Mim2 is the first layer, M3 is the second layer and 
etc. 

Now let's consider the variables in the Ith layer, i.e., Xi is an essential variable of Mi, 2 < I < 
r — 1. We have M; = (xj + ai)mi and 

«i = 2^ (/(xi,. . . ,0, . . . ,x„) e /(xi, . . . , 1, . . . ,x„)) 

{xi,...,Xi-l,Xi+-i_,...,X„)&'^~'^ 

^ Mi...M;_imz(Mi+i(...(M^®l)...)el). 



2n-l 



(xi,...,a:i_i,Xi+i,...,x„)GF2 
r— /+1 r— ^+1 

^ ^ / ]^y7-i2»-i-((fci+-+fc/.-i)+fc;-n+...+fc,4-i-i)) ^ ^ f_iy-i2"-n=^''=« 

2n— 1 / > ^ ' 2"^-*- ^ ■' 

i=i j=i 
by Equation 14.31 in Theorem 14.11 Note, Mi . . . Mi_imi is the first layer, M^+i is the second 
layer, and etc. 

Let Xi be the variable in the last layer Mr, we have 

^ ^ M1M2... Mr-irrir ^ 



by Lemma l4.ll 

Variables in the same layer have the same activities, so we use A\ to stand for the activity 
number of each variable in the Ith layer Mi, 1 < I < r. We find the formula of Aj for 2 < / < r — 1 
is also true when / = r or r = 1. Hence, we write all the above as the following 



Theorem 4.2. Let f be a NCF and written as in the Theorem [37 
then the activity of each variable in the Ith layer , I < I < r, is 



A' = ^ E(-l)^'"2''-^""'' (4.5) 
The average sensitivity of f is 



r r—l+1 

f 



E ki^l = ^ E E (-l)-^2"-nil- (4.6) 



' ' 2 

1=1 1=1 j=i 

We do some analysis about the formulas in Theorem 14.21 we have 
Corollary 4.2. n > 3, A( > > . . . > aI and ^ < < 2 - 
Proof. 

Af — f — l )i-l2"~S£l ^ = t2n-ki-...-ki _ 2n-ki-...-ki+i , (_lY-l\ 

I 2'"^-'^ ^ ■ ■ ■ \ ) I 
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Since the sum is an alternate decreasing sequence and ki^i > 1, we have 

2"— 1 — 2"'^i / J 2""-*^ 

Hence, 



We have 



1 .2n-fci-...-fc,+i) < 1 .2n-fci-...-fc,-U ^/ 
t+1 on-l — on-l ^ / ^ ( 



_ /2n-A:i _ 2n-fci-fc2 i 2"-fci-fo-A:3 _ 

1 2"^-*- • • • V 7 



2n-l 

Hence, = Yl'i=i hM > + gfer + • • • + 2^ = 2^' ^° ^'^^^ ^^^^ LAYER 

NUMBER 1 has the minimal average sensitivity. 

On the other hand, = ^[^^ kAj < ^T""^^ + ^T^^^^~^-^ + . . . + 2^2"-^'i--"^'' + 
• • • + gfe" = f^(^'ii • • • ) ^r), where /ci + . . . + fc^ = n, /cj > 1, i = 1, . . . , r — 1 and kr > 2. We will 
find the maximal value of U{ki, . . . , kr) in the following. 

First, we claim fc^ = 2 if U{ki, . . . , /c^) reach maximal value. Because if kr is increased by 1, 
and the last term makes ^^h^i more contributions to U{ki, . . . , kr), then there exists ki will be 
decreased by 1 {ki + . . . + kr = n), hence 



2""i 



n-l ' 



will be decreased more than 
Now, Look at 2"~^^, it is obvious it attains the maximal value only when A;i = 1 or 2 but 
obviously ki = 1 will be the choice since it also make all the other terms greater.. 

Now Look at ^"i^ 2'"~^i^^2^ attains the maximal value when /ci = A;2 = 1 or /ci = 1 and 
k2 = 2, again, A;2 = 1 is the best choice to make all the other terms greater. 

In general, \i ki = ... = ki-i = 1, then ^n-i 2'^~^^~'"~^' attains its maximal value when 
ki = 1, where 1 < / < r — 1. 

In summary, we have showed that U{ki, . . . ,kr) reaches maximal value when r = n — 1, 
ki = ... = kn-2 = 1, ^n-i = 2 and 

Max U{ki, ...,kr) = . . . , 1, 2) = 27^(2"-i ^ 2"-2 + . . . + 22 + 2) = 2 - □ 

Remark 4.3. So, we know the average sensitivity is bounded by constants for any NCF with 
any number of variables Since the minimal value approaches to and the maximal value of 
U{ki, . . . ,kr) approaches to 2 as n ^ 00. Hence, < s-^ < 2 for any NCF with arbitrary 
number of variables. 



In the following, we evaluate the formula Equation 14.61 for some parameters ki, . . . ,kr, we 
have 
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Lemma 4.4. 1) When r = n — 1, ki = . . . = kn-2 = 1; ^n-i = 2, s-^ = | — ^"j^x2"^ 

Given n > 4, r = n - 2, /ci = . . . = A;„_3 = 1, A:„_2 = 3, s-'' = | - — ; 

3) If n is even and n > 6, r = ^ , ki = 1, k2 = ■ ■ ■ = = 2, A;^. = 3, s-'^ = | — 33^- 

Hence, these three cardinalities are equal if n is even. 

Proof. When r = n — 1, ki = . . . = kn~2 = 1, ^n-i = 2 by Equation 14.61 We have 

«=i Z=l j=l 

^ n—l n—l—l ^ n—1 ^ ^ 

= ^ E E (-iy-^2"-^-'+i + i-ir-^-') = ^Y1 kiif""-'^' + 3 (-1)"-') 
1=1 j=i 1=1 

= J-(y (12"-'+' + -(-1)"-') + 2) = - - ^±t})l 

2n.-iVZ^^3 3^ ^ ^ ^3 3x2" 

1=1 

The other two formulas are also routine simplifications of Equation I4.6[ □ 

Based on our numerical calculation, Lemma 14.41 and the proof of Corollary 14.21 We have the 
following 

Conjecture 4.5. The maximal value of is s-' = ^ 33^2^' '"^^^^ reached if the NCF 

has the maximal LAYER NUMBERS n — l, i.e., if r = n — 1, ki = . . . = kn-2 = 1? ^n-i = 2. 
When n is even, this maximal value is also reached by NCF with parameters n> A, r = n — 2, 
ki = . . . = kn-2, = 1) kn-2 = 3 or ?i > 6, r = ^ , /ci = 1, A;2 = ... = ^^-i = 2 and k^ = 3. 

Remark 4.6. When n = 6, the NCF with ki = 1, k2 = 2, k^ = I and k^ = 2 also has the 
maximal average sensitivity |^ . But this can not be generalized. If the above conjecture is true, 
then we have < < | for any NCF with arbitrary number of variables. In other words, both 
and I are uniform tight bounds for any NCF. 

We point out, given the algebraic normal form of /, it is easy to find all of its canalyzing 
variables (the first layer Mi), then write / = Alig + b, repeating the schedule to g, we can easily 
to determine if / is NCF, if yes, we then write it as the form in Theorem 13.21 

We end this section by the following example. 

Example 4.7. Let N{xi,X2, 2:3, 2:4) = X1X2X3 © X2X3X4 © xix^ © X3X4 © 1 and 

Y{xi,X2,Xs,X4,X5) = XiX22;3X4X5©Xi2;2a^32;4©XiX2X4X5ffiXiX2a;4©XiX3X4©Xi3;3©XiX4©Xi. 

For A^(xi, X2, X3, X4), for all the 4 variables, we found when X2 = 1 or X3 = 0, then functions 
becomes constant 1, so we know N{xi,X2,xs,X4) = (x2 © l)(x3)iVi © 1. Actually, 

N{xi,X2,Xs,Xi) = X1X2XS © X2X3X4 © X1X3 © X3X4 © 1 = X3(xiX2 © X2X4 © Xi © X4) © 1 

= X3(x2(xi © X4) © xi © X4) © 1 = a;3((x2 © l){xi © X4)) © 1. Since xi © X4 has no canalyzing 
variable, we know N is not NCF, but a partially NCF. 

For y(xi, X2, X3, X4, X5), We find xi = or X3 = 1, the function will be reduced to 0, so we 
know Y = Xi(x3 © l)Yi. 

Where Yi = X2X4X5 © X2X4 © X4 © 1, for this function we find only when X4 = 0, Yi will be 
reduced to 1, so Yi = X4y2©l; where Y2 = X2X5©X2©1, and finally, we have I2 = X2(x5©l)©l, 
So Y is NCF with n = 5, r = 3 and ki = 2, /c2 = 1, k^ = 2, Mi = xi(x3 © 1), M2 = X4 and 
M3 = X2(x5 © 1), hence its Hamming weight is 5 by Equation \4.3\ and its average sensitivity is 
^by Equation \4.6[ 
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5. Conclusion 

We obtain a complete characterization for nested canalyzing functions (NCFs) by deriving 
its unique algebraic normal form (polynomial form). We introduced a new invariant, LAYER 
NUMBER for nested canalyzing function. So, the dominance of nested canalyzing variables is 
quantified. Consequently, we obtain the explicit formula of the number of nested canalyzing 
functions. Based on the polynomial form, we also obtain the formula of the Hamming weight 
of each NCF. The activity number of each variable of a NCF is also provided with an explicit 
formula. Consequently, we proved the average sensitivity of any NCF is less than 2, hence, we 
proved why NCF is stable theoretically. Finally, we conjecture that the tight upper bound for 
the average sensitivity of any NCF is |. 
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